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Abstract. We review some techniques from non-linear analysis in order to 
investigate critical paths for the action functional in the calculus of variations 
applied to physics. Our main intention in this regard is to expose precise math- 
ematical conditions for critical paths to be minimum solutions in a variety of 
situations of interest in Physics. Our claim is that, with a few elementary tech- 
niques, a systematic analysis (including the domain for which critical points 
are genuine minima) of non-trivial models is possible. We present specific 
models arising in modern physical theories in order to make clear the ideas 
here exposed. 



1. Introduction 

The calculus of variations is one of the oldest techniques of differential calculus. 
Ever since its creation by Johann and Jakob Bernoulli in 1696-97, to solve the 
problem of the brachistochrone (others solved it, too: Newton, Leibniz, Tschirnhaus 
and L'Hopital, but their methods were different), it has been applied to a variety 
of problems both in pure and applied mathematics. While occupying a central 
place in modern engineering techniques (mainly in control theory, see [9], [38], [47], 
[65j . |75|). it is in physics where its use has been promoted to the highest level, 
that of the basic principle to obtain the equations of motion, both in the dynamics 
of particles and fields: the principle of stationary action (see |5J, [K)J, [32J, [56J). 
Accordingly to that point of view, almost every text on mechanics include a chapter 
on the calculus of variations although, surprisingly enough, the treatment in these 
texts is expeditious and superficial, directly oriented towards the obtention of Euler- 
Lagrange's equations, leaving aside the question of whether the solutions are true 
minima or maxima, despite the importance of this distinction (for instance, while 
in fields such as optics one is interested in the minimal optical length, in stochastic 
dynamics one seeks to maximize the path entropy |71j . On the other hand, while 
the principle of stationary action just selects critical paths, experimentally an actual 
minimum is detected in some systems, see |25|). 

It is interesting to note that the principle was once called the principle of least 
action, although it was soon realized that many physical phenomena does not follow 
a trajectory that realizes a minimum of the action, but just a critical path (the main 
example is the harmonic oscillator, whose trajectory in phase space only minimizes 
the action for a time interval of length which depends on its frequency, see Sec. 
V of [34], which we recommend to get details about the physical meaning of the 
action integral and its extremals). In view of these phenomena, and because the 
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emphasis was on the equations of motion, the elucidation of the true nature of the 
critical paths of the action functional was omitted, and the interest focused on the 
stationary property. However, some recent papers have made a "call to action" (the 
pun is not ours, see [66] . |51| . |34| . |33j). renewing the interest in their extremal 
properties, not only their character of paths rendering the action stationary. 

Our aim in this paper is twofold: on the one hand, to offer a concise, yet rigorous 
and self-contained, overview of some elementary techniques of non-linear analysis 
to investigate the extremals of an action functional. On the other hand, we intend 
to show several non trivial examples of physical interest illustrating the use of these 
techniques. We have avoided the well-known cases, so our examples go beyond 
oscillators and central potentials, and are taken from modern theories, ranging 
from astrophysics (Lane-Emden equations) to relativistic particles with energy dis- 
sipation. Each one of these examples has been chosen to illustrate some particular 
feature. Thus, example 16 . 1 1 shows a Lagrangian for a dissipative system; in example 
16.2.31 as a bonus of the theory developed, we explicitly compute the solution (and 
its zeros) of an equation of the form y" + q{x)'^y = with q(x) rational; example l6.6l 
contains a justification of the use of Lagrange multipliers in the maximum entropy 
principle, etc. 

We also differ from previous works, such as [M] or [33] i in the flavour of the 
treatment: we feel that discussions trying to explain some plain analytic effects in 
physical terms are too lengthy, and sometimes add confusion instead of enlight- 
enment when it comes to explicit computations. Thus, we center our exposition 
around the analytic definition and properties of the Gateaux derivatives of func- 
tionals defined by integration (Lagrange functionals) and the techniques for the 
study of the behaviour of solutions of differential equations such as convexity or 
the comparison theorems of the Sturmian theory. This will be particularly patent 
in section [SJ where we show that the main result in [34] is a direct consequence of 
well-known properties of the zeros of the Jacobi equation (see Proposition 15.11 and 
comments) . 

We offer short proofs for those results that seem to be not common in the physics 
literature. The bibliography, although by no means complete, is somewhat lengthy 
as a result of our efforts to make it useful. 

2. Calculus of variations 

In this section we will briefly describe some basic concepts in the calculus of 
variations in order to set up our notation and conventions, and also in order to 
introduce the Jacobi equation and conjugate points as explicit criteria for a given 
extremal solution to be a minimum. We will start by discussing Gateaux deriva- 
tives and extrema of functionals. For general references on the topics of functional 
analysis and calculus of variations, see ^15]. ^5]. P?]- P]. P], P|. [70]. [75]. 
Note that we deal with the local aspects of the theory, exclusively. There are several 
approaches to the global setting, some of these were developed in [29], [24], [59| : 
more modern versions are developed in |45| . For detailed accounts of the theory 
involved in the global analysis, see [44] and |26[ . 

2.1. Gateaux derivatives. Let (-E, || • ||) be a Banach space, C £' an open 
subset of E and yo € D. Given a functional J : — M, if i) S E' is a non zero 
vector and |t| small enough, yo + tv will lie in D so the following definition makes 
sense. 
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Definition 2.1. Whenever it exists, the limit 

J{yo + tv) - J{yo) 



SJ{yo,v) := lim ■ 



t^o t 

is called the Gateaux derivative (or first variation) of J at yo in the direction v d E. 
This defines a mapping dJ{yQ, •) : i? — >■ R. // this mapping is linear and continuous, 
we denote it by J'iyo) and say that J is Gateaux differentiahle at j/q- Thus, under 
these conditions, SJ{yo,v) — J'{yo){v). Another common notation is dJ{yQ,v) — 
5y„J{v). 

The yo £ D such that J'{yo) = are called critical points of the functional J . 

The extension to higher-order derivatives is immediate. If, for a fixed v € E, 
5J{z, v) exists for every z €i D, we have a mapping D :— ^ R and we can compute its 
Gateaux derivative. Given an yo € D and z,v G E, the second Gateaux derivative 
(or second variation) of J at yo in the directions v and z (in that order) is 

S'j{yo,v,z) := hm SJyo+M-Sy.Jiv) _ 

If 6^J{yo,v,z) exists for any z,v G E, and {v,z) S'^J{yo,v,z) is bihnear and 
continuous, we say that J is twice Gateaux differentiahle at yo G D, and write 
J" {yo) for this mapping. With these notations, we will write 

^yo-^^"") = ^'^J{yo,v) := S'^J{yo,v,v). 

Remark 2.2. For fixed yo (z D and v G E, if we consider the function j(yg,v) ■ 
M — 5- M 6i/ j(yg,v)it) = J{yo + tv), it is obvious that it is defined in some open 
neighborhood ofO,]— s,e[, and the higher-order variations of J are given by 

S-J{yo,v)^j^;l^{Q). 

We will be interested in a particular class of functionals. To introduce it, we first 
need a technical observation: given an open subset J7 C K^, the set 

Du = {y e CH[a,b]) -.yx e [a,b],{x,y{x),y'{x)) e U} 

(the prime denotes derivation, although we will also make free use of the physicist's 
dot notation for derivatives) is evidently contained in the Banach space (C^([a, b]), \\ 
• II), endowed with the norm 

IMI = IMIo + l|y'llo, 

where || • ||o is the supremum norm. Moreover, Du C C^{[a,b]) is an open subset. 
This follows from the fact that for a given yo G Djj the set {x,yo{x),yo{x) : x S 
[a, b]} is compact, so it has an open neighborhood contained in U. 

Definition 2.3. A function L S C^(C/) is called a Lagrangian. To every Lagrangian 
it corresponds a functional J : Du — > M, called its action, defined by 

J{y)=j L{x,y{x),y'{x))Ax. 

J a 

Proposition 2.4. For any U C R'^, the action J is Gateaux differentiable on Du . 

Proof. Let y e Du- Taking into account the remark [2?2l note that for any t e] — e, e[ 
(applying Leibniz's theorem of derivation under the integral): 

4,.)W= / ^^{L{s,y{s)+tv{s),y'{s)+tv'{s)))ds. 
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Evaluating the derivative at < = 0, we get 

5J{y, v) = t {v{s)D^L[s, y{s), y'{s)) + v'{s)D:,L{s, y{s), y'{s))) ds. (1) 



Note that 6J{y,v) is Hnear in v. Moreover, 

\6Jiy,v)\<\\v\\ [ i\D2L{s,yis),y'is))\ + \D3L{s,y{s),y'{smds, 



where the integral exists (because the DiL, i E {1,2}, are continuous on U and 
y, 2/' on [a,b]) and it is a number depending only on y, thus constant for fixed y. 
Then, SJ{y,v) is also continuous in v. □ 

2.2. Local extrema of functionals. 

Definition 2.5. Let J : 13 — > M 6e a functional and let j/o G D. We will say that 
J has a local maximum (local minimum, respectively) in yo if for all y E G, where 
G <Z D is a convex neighborhood of the point yo, it follows that 

J{yo) > J{y) 

(J{ya) < J{y), respectively). 

Theorem 2.6. Let J : D ^ M. be a functional and yo G D. Then: 

(1) (Necessary condition) If J has a local extremal and the variation SJ(yo,v) 
exists for some v £ E, then SJ(yo,v) = 0. Thus, if J is Gateaux differen- 
tiable at yo, J'(yo) = 0. 

(2) (Sufficient condition for a minimum) The functional J has a local minimum 
at yo whenever the following hold: 

(a) For each v E E, 5J{yo,v) — 0. 

(b) (Coercivity) For any y in a convex neighborhood ofyo, the second vari- 
ation S'^J{y,v) exists for each v G E. Moreover, there exists a c > 
such that 

S^J{yo,v)>c\\v\f, 

for all V € E. 

(c) (Weak continuity) Given e > 0, there exists an rj > such that 

\6^Jiy,v)-S^Jiyo,v)\<e\\v\\^, 
for any v Cz E and y satisfying \\y — yo\\ < 

Proof. 

(1) Consider j(yp,«)(i) = J{yo + tv), so if J has a local extremal at yQ, j(yg.v) 
has a local extremum at t = 0. Then, it must be (recall remark 12. 2p 
= fiyo.v)iO)^SJ{yo,v). 

(2) Suppose each of ([2a|) . (j2b|) . ([2c| holds. As before, we have j[yg y-jit) — 
SJ{yo + tv, v) and j'^^^ ^^^(i) = 6^J{yo + tv, v). The hypothesis on the second 
derivatives of J allows us to develop j(yg^v) {t) by Taylor in the interval [0,1], 
and there exists a ^ g]0, 1[ such that 

J{yO + V) - Jiyo) = i(yo,«)(l) - hyo,v)iO) = lj{yo,v)(0- 
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As j"^^ — S'^J{yo + £,v, v), we have the fohowing bound: 
J{yo + v)- J{ya) = 

\5^J{yo,v) + i (^<5V(2/o + ^v, v) - ]^5\j{yQ,v)^ > 

^c||f IP + i (^S^J{yo + iv,v) - ^S^J{yo,v) 

Taking e = c/4 in (Pel) , there exists an ?/ > such that 

\S^J{y,v)-S^J{yo,v)\ < 

for lly — T/oll < V- Choosing now ||w|| < cr]/2, it is ||j/o + S,v — yo\ \ < \\v\\ < 
C77/2, so 



\S^Jiyo + ^v,v)~S^J{yo,v)\ < ^\\v\\^ 



Substituting: 



Jiyo + v) - J{yo) > ^c\\v\\^ - ^c\\v\\^ = ^c\\v\\^ > 0, 
so J has a local minimum. 

□ 

Remark 2.7. Writing c < and reversing the inequality for 6'^ J {yo,v) in (j2bp . we 
get sufficient conditions for a local maximum. 

Remark 2.8. Note that condition (|2ap alone does not guarantee the existence of a 
local minimum, see counter-examples in [53j, §2.10. 

It is interesting to particularize the condition 6J{yo,v) = to the case of an 
action functional. For this, we need a couple of technical results whose proof is 
straightforward, but anyway can be found in any of the references cited at the 
beginning of this section. We will denote Co([a, 6]) = {/ G C''([a, 6]) : f{a) = = 
/(&)}• 

Lemma 2.9 (Lagrange). Let f G C([a, b]) be a continuous real-valued function over 
the interval [a, b] .such that 

b 

f{x)^{x) da; = 
for all fi G Co([a, &]). Then it follows / = 0. 

Lemma 2.10 (DuBois-Reymond). Let f G C([a, 6]) and g G C^([a, 6]) such that 

b 

{f{x)fi{x)+g{x)fi'{x))dx = 

for all fi G CQ([a, &]). Then it follows g' = — /. 

Now, a simple integration by parts in ([1]) . and the application of Lemmas 12.91 



2.101 and Theorem 12.61 leads directly to the following result. 
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Theorem 2.11 (Euler-Lagrange) . Ify(z D is an extremal (maximum or minimum) 
for the action functional J : Z? — > M given as in Definition \2.3\ then y must satisfy 
the Euler-Lagrange equations 

D^Lix, y{x), y\x)) - ^^3^(2^, y{x), y'ix)) = . (2) 

Remark 2.12. In physics literature, it is common to commit a slight abuse of 
notation and to write the Euler-Lagrange equations in the form 

d d d 

—L{x,y{x),y'{x)) - ——L{x,y{x),y'{x)) =0. 
oy ax oy 

Note that, for the case at hand, writing L(s, y{s) + tv{s),y' (s) + tv'{s)) = L{s) 
for simpUcity: 

Hy,v){t) = / {vHs)D22L{s) + 2v{s)v'{s)D2zL{s) + {v'f{s)D^^L{s))ds, 

J a 

SO, evaluating at i — 0, 

5^J{y,v) = f'jv^{s)D22L{s,y{s),y'{s)) (3) 
-^-2v{s)v'(s)D23L{s,y{s),y'{s)) + {v')^{s)D33L{s,y{s),y' {s)))ds. 

It is now a routine computation (continuity arguments and Schwarz inequahty) to 
prove that for an action functional J{y) = L{x,y,y')dx with L E C^([/) such 
that its second partial derivatives are bounded on U, under the hypothesis (|2ap and 
(pbl) of Theorem the condition (gc]) is satisfied ([H], pg. 224). Thus, a path 
ya € D is a minimum if it satisfies Euler-Lagrange's equations ^ and the second 
Gateaux differential at j/o is coercive, that is, there exists a c > such that, for all 
V e E: 

S^J{yo,v)>c\\vW'. (4) 

2.3. Problems with constraints. The calculus of variation is frequently applied 
when there are constraints. The problem can be reduced to that of extremizing a 
single functional constructed out from the original one and the constraints. 

Definition 2.13. Let J be a functional defined on a Banach space {X, \\ ■ ||). We 
say that 5 J is weakly continuous at y € X if: 

(a) The domain of J contains an open neighborhood D 3 y, and, for each h G X, 
the variation 5J{y, h) is defined. 

(b) lim2_j.j, 5J(z, h) — 5J{y, h). 

If there exists an r > such that 6 J is weakly continuous for every z G B(y; r), we 
say that 6 J is locally weakly continuous at y, or simply weakly continuous near y. 

When we have an open subset C/ =]a. 6[x]R x M c K.'^ and an action functional 
J : Du — M, J{y) — L{x,y,y')dx, it is easy to see that imposing some mild 
conditions on the Lagrangian L G C^{U) we obtain a weakly continuous functional. 
For instance, it is enough to require that the second partial derivatives of L be 
bounded on U, or that its first partial derivatives be uniformly continuous. For 
most of the actions appearing in physics, however, it is usually easier to prove the 
weak continuity directly from the definition (cfr. Example 16. 6p . 
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Let now J =: Ko,Ki, ...,Kr be functionals defined on D, all of them Gateaux 
differentiables at each point y G D. We will assume that the set 

S={y€D:Ki{y) = ki,l<i<r} 

is not void, and that yo is an interior point of S such that J\s has a local extremal 
at yo- 

Proposition 2.14. Let 5Kj be weakly continuous near yo, < j < r. Then, for 
any h =: ho, hi, hr € X we have: 

det{SKj{yo, hm)) = 0, 0<j,m<r. 

Proof. By reduction to the absurd. Let us assume that there exist h,hi,...,hr 
such that the determinant is non zero. As yo has an open neighborhood D, there 
exist a set of scalars a, Pi, (3r such that yo + ah + fHihi + ■ ■ ■ prhr & D, and the 

variations SJ, 5Ki, SKr are continuous a,t yo + ah + l3ihi-\ f3rhr. Let us define 

the function F : W+^, on a neighborhood G of the origin (0, 0) e M'^+S 

by 

Fp+i{a,l3i,...,l3r) = Kp{yo + ah + Pihi H PrK), 

for < p < r (remember Ko = J and ho = h). It is immediate that 

Dq+iFp+i{a,l3i,...,Pr) = 5Kp{yo + ah + j3ihi -\ Prhr,hq), 

for < q < r. Now, as 6J,6Ki, ...,SKr are continuous near yo, by shrinking G if 
necessary we can assume F £ C^{G), with Jacobian at the origin: 

det(i5iir-, (?/o, hm)) 7^ 0, 0<j,m<r 

by hypothesis. Applying to F the inverse function theorem in the neighborhood 
of the origin, we get that there exists an open subset V C K''+^, containing 
F{0,...,0) = {J{yo),ki, ...,kr), and a local diffeomorphism ip : V ^ G such that 
G = ifiiV) C G is an open neighborhood of the origin in W~^^, and for all 
{x,yi,...,yr) e V: 

{x,yi,...,yr) = F{ip{x,yi,...,yr)). 

In particular, (p{J{yo),ki, ...,kr) = (0,0,..., 0). As {J{yo),ki, ...^k,.) is a point of 
the open set V C W~^^, we can find in V two different points {xi,ki, kr) and 
{x2,ki, ...,kr) such that Xi < J{yo) < X2- Their corresponding images by f are 
{a^,Pl,...,Pl) = ip{xi,ki,...,kr) and {a"^ , pf, l3^) = ip{x2,ki, ...,kr). Thus, the 
vectors u = yo + a^h + I3\hi + • • • + and v = yo + a^h + (5\hi + • • • + fi'^hr 
belong to D. Moreover, 

F{a\Pl,...,Pl) = {xi,ki,...,kr), 

F{a'',Pl...,pl) = {x2,ki,...,kr), 
so, equating components: 

J{u) = Ko{u) = Fo{a\pl ...,pl) = xi, 
Kp{u) = Fp{u) = Fp{a\pi, ...,pl) = kp, 

for 1 < p < r, and 

J{v) = Ko{v) = Foia^pl = X2, 
Kp{v) = Fp{v) = Fp{a'',pl = kp, 
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SO u,v £ S too. But, because of our choices, J{u) — xi < J(t/o) < X2 ^ 
and this construction can be repeated for xi,X2 arbitrarily close to J{yo), so (by 
the continuity of ip) the corresponding vectors u,v will be arbitrarily close to yo, 
contradicting the assumption that yo is a local extremal of Jjs. □ 

Theorem 2.15. Let 6Kj be weakly continuous near y^. Then, either 

Aet{5K,{yQ,hi))^Q, 1 < ij < r, 
or there exist a set of real numbers (the Lagrange multipliers) Ai, such that 

m 

SJ{yo,h) = ^XidKi{yQ,h). 

Proof. By Proposition I2.14i dci{5Kj{y{),hj)) = 0, < j < r; developing by the 
first column (where Kq = J and ha = h), we get 

m 

SJ{yo,h)dct{6K,{yn,hi)) + "^pL,6K,{yo,h) ^0, (5) 

1=1 

for some set of scalars fii, ...T^r (which depend on the vectors (hi, hr)). 
Then, either 

det{SK,{yo,hi))^0, l<i,l<r, 

or there exist some set of vectors vi, ...,Vr G X such that det{6Ki{yQ,vi)) ^ In 
this case, substituting in ([5]), we obtain 

m 

dJ{yo,h) = ^XiSKi{yo,h), 

1=1 

where 

A,; - - 



detiSK,{yo,hi))' 

□ 

3. Conjugate points 

Let us rewrite the conditions for a minimum of the action J, found in the pre- 
vious section, in terms of a differential equation involving the derivatives of the 
Lagrangian L. 

Proposition 3.1. Let the action J be given as in Definition \2.Sl and let y^ G D. 
Then, the second variation of J atyo, in the direction of av G CQ{[a,b]), 6"^ J(ya,v), 
reads 

S^J{yo,v) = ^j\pv'^ + Qv^)dx, (6) 

where the functions P{x) and Q{x) are explicitly given by 

P{x) = D^^L{x,y{x),y'{x)) (7) 

Q{x) = D22L{x,y{x),y'{x))-^D2Mx,y{x),y'{x)) (8) 

da; 

Proof. Just make an integration by parts in the middle term of the integrand in 
([3]), taking into account the boundary conditions on v G Ca{[a,b]). □ 
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Remark 3.2. The notation used in physics is: 

Q 



d{y'Y ' 

d^L d / d'^L 
dy^ dx \dydy' ^ 

Lagrange considered equation ^ already in 1786. He thought that a suffi- 
cient condition to have a minimum would be the positivity of the second variation 
d^J{yo, v) > (which is not true: coercivity is needed), so he tried to "complete the 
square" in ([6]) by introducing a boundary term of the form gv'^ /2, where g g C([a, b]) 
is to be determined. In this way, we have 

S'J{yo,v) = l\p{v')'+Qv^) dx + J' ^{gv')dx 

{P{v'f + 2gvv' + [g' + Q)v^) dx (9) 



^ P(v' + ^v)\\x + (^g' + Q-^-^v^dx. 



Thus, it is straightforward that 6'^J{yo,v) will be positive definite if the following 
conditions are satisfied 

P = D33L{x,y{x),y'{x))>0, (10) 
g' + Q — '— = has a solution g . (11) 

Condition ([TU|) is known as the Legendre condition. Also, note that equation pT|) 
is of Riccati type. This equation is basic to determine the extremality properties 
of critical points of the action. 

Definition 3.3. Let J be an action functional, and let f G C{[a, b]). The differential 
equation for f 

where P and Q are given in ([7]) and ([5]), respectively, is called the Jacobi equation. 

Notice that Jacobi equation is simply obtained by introducing the change of 
variable g = — Pd(ln/)/da; in equation (fTTj) . which renders it linear. Once we 
solve the equation for /, we get g and then we can assure that, if P > 0, then 
S^J{yo, v) > 0. Although we know that this is not enough to guarantee a minimum 
(recall Remark l2.8p . the properties of the solutions to the Jacobi equation will 
lead us to an equivalent condition for a minimum of the action, given in terms of 
quantities determined by the Lagrangian. 

Definition 3.4. Two points p,q G M (with p < q) are called conjugate with respect 
to the Jacobi equation (|12p if there is a solution f G C^([a, 6]) of (jl2p such that 

/Im^o «^rf/b) = o = ./('?)• 

The following result is just a particular case of K. Friedrichs' inequalities for the 
one-dimensional case (see [1]). Its proof can also be done directly, as an application 
of the Schwarz inequality. 
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Lemma 3.5. For any v G CQ{[a,b]), we have 

\^{x)dx < (\v'f{x)dx 

^ J a 

Theorem 3.6. Let J{y) he an action functional as in Definition \2.S\ Sufficient 
conditions for a critical point yo of J{y) to be a local minimum in the interval [a, b] 
are given by 

(a) For all x £ [a, b] 

P{x)=D33L{x,y{x),y'{x))>0. 

(b) The interval [a,b] does not contain conjugate points at x = a with respect to 
Jacobi equation (|12p . 



Because of the assumptions made on the continuity of the derivatives of L and the 
compactness of [a, 6], we can choose a number a such that < ti < minja f,]{P(x)}. 
Inserting aP{v'Y ~ aP{v')'^ in the equation above, and repeating the computation 



Proof. Recah from PU)) the expression 

5V(yo, = {P{v'f + 2gvv' + {g' + Q)v^) 



in gives 

5^J{Vo,v) = j\p^a){v' + Ax (13) 

+ j^(^g' + Q-Y^-^v^dx + a j^{v'fdx. 

As P{x) > and we have chosen a such that P{x) — a > on [a, b], the first integral 
is positive, as it is the third one. In order to cancel out the second integral, we 
must take a g e C^{[a,b]) such that 

g' + Q- = 0. 

P — (7 



Introducing a function / € C^([a, b\) through 

f 



9^~^{P-<J), (14) 



we arrive at the equation for /: 



^ ^{P-a)^ ) +Q/-0. (15) 



dx \ dx 

By the theorem on the dependence on parameters of the solutions to a second order 
differential equation, the general solution of ([T5|) can be written as f{x,a), with 
/(a;,0) = f{x). Note that, by hypothesis, f{x,0) does not admit points conjugate 
to a in [a,b], so (by continuity), neither does f{x,a) for cr > but close enough 
to 0. If f{x) — f{x,a) is such a solution, by substituting the corresponding g of 
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equation p4|) into ()13| . we get 



5^J[yo,v)= f\p - a) fv' + dx + a j^v'fdx 



>a f {v')^dx. 

J a 



Now, applying Lemma 13.51 

S'Jiyo,v)> HHP 

~^ 2 

The statement follows then from Theorem 12.61 (see also the comments at the end 
of subsectioiil2.2l). □ 



Remark 3.7. We can obtain a criterion for a local maximum just by considering 
the condition P{x) = D'i^L{x^y{x),y' {x)) < and repeating the computations in 
the theorem with the inequalities reversed. 

Remark 3.8. In order to apply the criterion of conjugate points in practice, it is 
desirable to have at our disposal some tools for explicitly computing solutions of 
the Jacobi equation (I12p . An old (but useful) method (^53J pp. 56-57, [21\ pp. 4-2- 
43) is the following: the general solution of Euler-Lagrange's equations (which are 
second order) has the form y = y{x;a,l3), where a, 13 are constants of integration 
( on which the y dependence is differentiable, under some mild conditions. In the 
examples this will be obvious). Then, D2y{x;a, (3) = ^ and D'g,y{x;a, (3) = ^ 
are two independent solutions of the Jacobi equatioi^. We will use this method in 
example 1 6. 2. 3[ 

4. Convex functionals 

In this section we will discuss the particular case of convex Lagrangians. We will 
start by recalling that a subset 5' C M" is said to be convex if for all p,q E S the 
interval [p, q] lies entirely inside of S. This is equivalent to say that 

[p, q] := {p + i(g - p) = + (1 - t)p : < t < 1} c S. (16) 

Definition 4.1. A function f : S CZ M^^ defined over a convex set, is said to be 
convex if 

f{[p,q])<[f{phf{q)] 
or, equivalently, for all t G [0, 1]; 

f{p + t{q - pj) < f{p) + t{f{q) - f{p)) (17) 

Let / : S* — > M be a convex differentiable function. For any t € [0,1], p,q £ S we 
have (|17p . and, on the other hand, by the intermediate value theorem, there exists 
a, Wt G [p,p + t{q — p)] such that 

f{p + t{q - p)) = fip) + td^,J{q - p), (18) 



^The proof is extremely simple: just take derivatives with respect to, say, a in Euler-Lagrange's 
equations (applying the chain rule) and collect terms, taking into account that 

d / d'^L dy\ _ d r a'^L \ dy d'^L dy' 
Ax \ dydy' da ) Ax \ dydy' ) da dydy' da 
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where dwtf denotes the differential of / at wt- From (fT7| and (fT8| we get f{p) + 
id..J(9 -P) < tf{q) + (1 - t).f{p), that is: 

f{p) + d.Jiq-p)<fiq)- 

Taking the hmit t ^ imphes Wt — > p, so: 

/(p)+dp/(g-p) </(g). (19) 

Remark 4.2. There exist well-known criteria (in terms of the Hessian matrix) to 
decide whether a function f : S ^ R is convex or not, see for instance [70J sec. 
10.7. We will apply one such criterion in Example \6.2.1i 

As a straightforward appHcation of these resuhs, we have the following theorem, 
stating that the critical points of an action with a convex Lagrangian are always 
minimals. 

Theorem 4.3. Consider a set U = [a,b]x S C , such that for each fixed x G [a,b] 
the set Sx — {{x,u,v) G U} C is convex. Suppose that for any x G [a^b], 
the Lagrangian function L{x,-,-) : Sx ^ R is convex. Then, any critical path 
yo — yo{x) is a minimal solution, among the paths with the same endpoints, for the 
corresponding action functional J{y) = L{x,y{x),y' {x))dx. 

Proof. The hypothesis of convexity implies, by p9l) . 

L{x, U2,V2) > L{x, Ui,Vi)+ d(^x,ui,vi)L{{u2,V2) - {ui,vi)) (20) 

= L{x, ui,vi) + DiL{x, ui,vi){u2 - ui) + D2L{x, ui, Vi){v2 - vi). 

Now we compute the action on a critical path, yo{x), and an arbitrary nearby one 
y{x), with the same endpoints {yo{a) = y{a) and ya{b) = y{b)), and compare them. 
From (Uni): 

J{y) - J{yo) = I {L{x,y{x),y'{x)) - L{x,yo{x),yQ{x)))dx 



> [ {DiLix,yo,y'o)iy-yo) + D2L{x,yo,y'o)iy' -yo))dx. (21) 

The second term in the last integrand can be -as usual- integrated by parts, we 
then get: 

J{y)-J{yo)> J (DiL{x,yo,yQ) - -^D2L{x,yo,y'o)^ {y ~ yo)dx. 

But, by hypothesis, yo{x) is a critical path; equivalently, for each x e [a,b] it 
satisfies the Euler-Lagrange equations ([2|), and this implies J{y) > J{ya). □ 

Remark 4.4. Reversing the inequalities, we obtain the corresponding result for 
concave functionals. 

5. Sturmian Theory 

Sturmian theory is concerned with the analysis of the zeros that a solution of a 
linear second order differential equation, of the form 

f|+p(:r)^ + 9(x)2/ = 0, (22) 
ax'^ dx 
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(with p{x),q{x) piece-wise continuous) has in a given interval of the independent 
variable. This theory is an invaluable tool to check the properties of critical points, 
as we will see in the examples of the next section. 

The results stated here without proof are well-known (see [2], [SlJ). We write them 
just for easy reference. 

As it is well-known, the differential equation (|22[) may be written, through the 
change of variable v = yexp{^ J pdx), in its normal form 

— +r{x)v = 0, (23) 

where r{x) — q(x) — jp'^{x) — ^p'{x), which clearly preserves the zeros of the 
solutions to ([22l) if p{s)ds is finite for finite x. The first observation is that the 
zeros of such an equation cannot accumulate. 

Proposition 5.1. Let y{x) be a non trivial solution of (|22l) or (|23p . Then, its 
zeros are simple and the set they form does not have accumulation points. Thus, 
on each closed interval [a,b], y{x) only possess a finite number of zeros. 

Proof. If xo were a double zero, y{xo) = = y'{xo), so y{x) would be the trivial 
solution by uniqueness. 

If xq were an accumulation point for the zeros of y{x), there would be a sequence of 
zeros {xn) such that Xn -> xq. By RoUe's theorem, there is a zero of the derivative 
between any two consecutive zeros of the function, so there would be a sequence 
{um) of zeros of y'{x) such that Um ~^ xq. Then, by continuity of both y and y' , 
we would have y{xo) = = y' {x^), which, as we have just seen, is impossible. □ 

As a corollary, the zeros of a non trivial solution of or are: either a finite 
set, or a sequence diverging to -\-oo, or a sequence diverging to —oo, or a sequence 
diverging to ±oo. 

This applies in particular to the Jacobi equation ()12|) . Thus, if we have a solution 
f{x) on the interval [a, b] such that f(a) — 0, its first zero after x = a must be 
located at a point c > a. In other words: there exists a c > a such that there are 
no conjugate points in the interval [a, c]. A direct consequence of this fact is that 
for a short enough interval, critical points of the action J(jj) — L{x, y, y')dx such 
that P{x) = D'i^L{x,y,y') > 0, are local minimizers. 

Some authors state, erroneously, that for any Lagrangian the critical points are 
local minimizers. The origin of the confusion can be traced back to the fact that 
this is true for natural Lagrangianf0, in particular it is true for free Lagrangians 
{V = 0), for which the trajectories are geodesies. However, not every system of 
interest in Physics is natural. Of all the examples presented in this note, only 
Example 16.51 is natural; and in Example 16.61 we present a case for which the critical 
points are maximizers. 

Definition 5.2. // every solution y of (|22p or ([23]) has arbitrarily large (in ab- 
solute value) zeros, then the equation (and all its solutions) are called oscillatory. 
Otherwise, the equation and all of its solutions are called non-oscillatory. 



^That is, those of the form L{y,y') = K{y') — V(y) where ii" is a positive-definite quadratic 
form associated to some metric (usually K{y') = (j/')^/2, that is, the metric is the euclidean one) 
and V is some function. Note that in this case P > 0. 
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Theorem 5.3. Let he yi{x) and y2{x) denote two linear independent solutions of 
equation ([23]). Then, the zeros of both functions are distinct and alternating in 
the sense that yi{x) has a zero between any two consecutive zeros 0/2/2(2;), and 
vice-versa. 

Theorem 5.4. If r{x) < on [a,b], then no non-trivial solution of ([23|) can have 
two zeros on [a,b]. 

Theorem 5.5 (Sturm's Comparison Theorem). Let yi{x) and y2{x) be non-trivial 
solutions to the differential equation (|23p with ri(x) and r2{x), respectively. If 
ri > r2 in a certain interval [a, 6], then yi{x) has at least a zero between two 
consecutive zeros of y2{x), unless yi = j/2 on [a, &]. 

6. Examples 

In this section we will develop some physically motivated examples in order to 
elucidate the ideas analysed so far. In each case, the regularity conditions on the 
Lagrangian are trivially satisfied. Unless otherwise explicitly stated, we will work 
on the space X — C^{[a,b]). Also, in some examples we will follow the notation 
common in physics, taking t as the independent variable and x, x — dx/dt as the 
dependent ones. 

6.1. Driven harmonic oscillator. As it is well known, the differential equation 
for a driven damped harmonic oscillator under a sinusoidal external force is given 
by [30], [36], [41], [62], [M]: 

X + I3x + io^x = siii{iot) . (24) 

Its solutions are well-known. They have the form 

x{t) = ^ sin{ujt -\- ip) , (25) 

UJ/j 

where Z ~ ^J]p~\~^^jJuj^~~l^ is the impedance and (p = arctan ^^/"^2 ^ is the 
phase. 

The corresponding Lagrangian function is: 

Lit, X, X) = ie''* (i^ - _ ^2, 

Here we follow the standard conventions and denote G K as the natural oscil- 
lation frequency, /? g is the damping parameter, and w S K stands for the 
frequency of the driving force. For this Lagrangian, we straightforwardly note that 
the functions P — e^* > and Q = — w^e*^*, given by ([7]) and ([5]), respectively, lead 
to the Jacobi equation 

^ + + (26) 

which is the damped harmonic oscillation equation for the function /. Note that 
this equation is damped by the same amount as the driven equation (j24p . 
This is a second-order linear equation of constant coefficients, so it can be analyt- 
ically solved and we get the general solution (for the underdampec^l case /3 < wo) 



^The other cases (critical damping and overdamping) are treated similarly. 
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in the form 



f{t) = e-- fci sm ^ " + fc2 cos 



where ki,k2 are constants of mtegration. The solution verifying /(O) = can be 
written as 



from which we clearly see that its zeros are located at the values t — , "^^^ . for 



fc e Z. The first zero after i = is located at i = , Z , so the solution (1251) is 

a minimum on the interval [0, — p=^=[. When /3 = this interval becomes [0, — [, 
which is the particular case of the harmonic oscillator (see [34ji Pg- 446). 

6.2. Lane-Emden Equations. The Lane-Emden second-order differential equa- 
tion was originally proposed by Lane ^46j . and studied in detail by Emden [17j and 
Fowler (50] , in order to understand equilibrium configurations of spherical clouds of 
gas (self-gravitating polytropic gas spheres) [7], [T3|, [TS]. Lane-Emden equations 
also appears in several other contexts such as viscous fluid dynamics, radiation, 
condensed matter, relativistic mechanics, and even for systems under chemical re- 
actions (see [74], [28], and references therein for an account of its applications. For 
a mathematical treatment of their zeros, see |14J). The Lane-Emden equation is 
characterized by a non- linear term ?/"(x), where the non- negative parameter n G Z 
(the polytropic index, in its original context) defines the nature of the second-order 
differential equation 



(28) 



dx \ da; 

which may be obtained from the associated Lagrangian 



in the sense that its Euler-Lagrange equations reduce to (l27l) . For this Lagrangian, 
the functions ([7]) and ([8]) are given by P = a;^ and Q = —nx^y'^^^. respectively. As 
in the preceding example, the function P is always positive definite on any interval 
of the form ]0, 6], & G M. We also note that the function Q depends on the parameter 
n. In this way, we obtain the Jacobi equation 

1 d (.df\ ^ 



2 A >3;^j^ +7i2/"-\/ = 0. (29) 
x'^ ax \ ax J 

As the most frequent analytical solutions to the Lane-Emden equation are those 
corresponding to the values ti = 0, 1,5, (T?) (but see [2B] for other cases), we will 
focus next on solution of both, Euler-Lagrange and Jacobi equations, for these 
cases. 
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6.2.1. n=0. Consider any interval [0,6]. For this case, the general solution y{x) of 
Euler-Lagrange's equation ([27| reads 

y{x) = --^ H h fci , 

6 X 

where fci, ^2 are arbitrary integration constants. Note that this function is singular 
at the origin. The physical origin of the problem demands that the solution verify 
y{Q) — 1, ?;'(0) = 0, so we must take /c2 = 0, fci = 1. The solution is then 

x^ 

y(a^) = -y + 1- 

In this very simple case, it is not necessary to deal with the Jacobi equation, as the 
corresponding Lagrangian 

„2 / y 



L{x,y,y') = x^ y— - y 

determines, for each x G [a^h] fixed, a function L{x,-,-) which is convex on the 
convex set M x R, as it has a semi-definite Hessian: for any u, w S K, 

det Hessi(a;, u,v) — Q 
D22L{x,u,v) = 
D33L{x,u,v) =x^ >0. 
Thus, the solution is minimal on ]0.,b], for any 6 > 0. 

6.2.2. n=l. The general solution the Lane-Emden equation (1271) is given by 



, , , sin(a;) , cos(x) 
y{x) = ki + k2 — (30) 

X X 

where fci, fc2 are constants. Again, the physical meaning of the problem imposes 
the condition that the solutions be defined at a; = (where it must be y{0) — 1), 
so we must take fci = 1, fc2 = 0, getting the sine function 

/ A sin(a:) 

y{x) = . (31) 

X 

As the functions P — x^, Q ^ — in this case, Jacobi equation ([2^ results again a 
Lane-Emden equation with n ~ 1 for the function f{x). Hence, for n — I solutions 
of the Jacobi equation and the Lane-Emden equation are of an identical nature. 
Thus, in view of the solutions of the Jacobi equation defined for x = are 

those of the form f{x) = Csinc(a;). They have zeros located at the points x = kn, 
k G Z — {1}. So, on any interval of the form ](fc — l)7r, fc7r[, fc e Z — {0,1}, the 
solutions (I3ip are minimal. A minimum is also obtained on ] — tt, 0[ and ]0, tt[. 

6.2.3. n=5. Analogously, in this case, the general solution to (P7)) reads (see [13], 
[M0): 

V /3 + 3 



"^Although in these references only a 1— parameter family of solutions is given, it is easy to 
trace back the missing parameter a from the calculations presented there (it is fixed at certain 
point to make the output of an integral more manageable). 
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We further note that if we impose the boundary conditions y{Q) — 1 and y'(0) = 
(which set a — 1, (3 = 3) our solution becomes the common one 

We will work on this simplified solution. For this case, the function Q = —bx^y"^, 
thus yielding the Jacobi equation 

dV 2 d/ 45 
— - H -\ 

dx^ X dx (3 + x"^) 

which may be simplified to its normal form (j23p by the change f{x) — u{x)/x: 

d^" 45 

d^+(3T^"^'- (''^ 

Let us apply the method outlined in Remark 13.81 The derivative of the general 
solution with respect to the parameter a, evaluated at the values that give the 
solution we are considering, is 

V3 (x^ ~ 3) 



2^-.± + Tr-^2f = ^^ (34) 



D2L{x-l,?,) 



2(a;2 + 3)' 



It is immediate to check that this is indeed a solution of Jacobi's equation (p4| . 
The derivative with respect to /3 gives nothing new (a multiple of D2L{x; 1, 3)). By 
making the change of variables stated above, we get: 

V3x{x^-3) 

U[X) = 



2(x2 + 3)5 

Note that u(0) = 0. The first (and only) zero of u{x) after a: = is given by a; = -\/3- 
Thus, the solution to the Lane-Emden equation for n = 5 p3p. is a minimum for 
X e [0, V3[. 

6.3. Quantum gravity in one dimension. In modern physics, spin foams mod- 
els have been introduced in order to analyse certain generalizations of path integrals 
appearing in gauge theories. In particular, in quantum gravity the spin foam ap- 
proach has been developed as a tool to understand dynamical issues of the theory by 
the introduction of discretizations describing the metric properties of spacetime [3] . 
To some extent, spin foams for quantum gravity were motivated by a particular 
discretization of general relativity known as Regge calculus |73) . In this context, 
a discrete model for a scalar field representing gravity in one temporal dimension 
has been studied in detail in [35]. Here we present the continuum analogue of this 
model. The general action functional is; 

Jiy)^l J V^){9-Hx)y'ixy+u;y^x))dx, 

where g : R — R is a positive function which acts as the metric on M, and will be 
taken in what follows as g{x) = exp(x), for simplicity. Thus our model Lagrangian 
will be 

L{x,y,y') = lexp{x/2) {exp{^x){y')^ + ojy^) . 
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The Euler-Lagrange equations readily follow: 



uj exp{x/2)y + i exp(-x/2)y' - exp{-x/2)y" = 0, 



or, as exp(— a;/2) > 0, 



y" - \y' - u:e^y = 0. (36) 



By making the change of variable u = exp(a;/2), we can put (|36p in the form 

which, as M = exp(a;/2) > 0, reduces to 

^-4.,(.) = 0. 

This equation is integrated by elementary techniques; its solutions are: 

y{u) = k\ exp(2-\/a;u) + k2 exp(— 2\/wm), 

(with fci, ^2 constants of integration) or, in the original variable x: 

y{x) — ki exp(2Va; exp(a;/2)) + k2 exp(— 2VcJ exp(a;/2)) . (37) 

The coefficients of the Jacobi equation are P = exp(—x/2) > and Q — uj exp{x/2), 
so the Jacobi equation is (after simplifying an exp(a;/2) > factor): 



dx'^ 2 dx 



which has the same form as the Euler-Lagrange equation (I36p (a phenomenon al- 
ready encountered in the case of the Lane-Emden equation for n — 1, recall Example 
I6.2.2[) . Thus, the general solution to the Jacobi equation is 

f{x) — ci exp{2\/uj exp{x/2)) + ci exp(— 2\/a; exp(x/2)), 

with ci , C2 constants of integration which can be fixed by the initial conditions 
/(O) = 0, /'(O) = 1, giving 



e 



Cl = ^ r- ■, C2 



2^ ' 2v^' 

Substituting in the expression for f{x) above, the solution to the Jacobi equation 
can be written as 

fix) = ^ (exp((e-/2 - 1)2^) - exp(-(e-/2 - 1)2^^)) 

= ^sinh(2Vi:;(e"/2-l)), 

which clearly shows that there are no conjugate points to a: = 0. The solutions ([57)) 
are thus minimals on their interval of definition. 
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6.4. Square root Hamiltonian with a dissipation term. Square-root Hamil- 
tonians (or Lagrangians) are a standard feature of many reparametrization invariant 
field theories [TT], [53]. The action of a relativistic particle [TS], [22], [4S], and the 
action of the Nambu string are familiar examples [75j . Further examples of 
physical theories where this sort of Hamiltonians appear include general relativ- 
ity [22], [50], as well as certain approaches to quantum gravity [123 , [50], and also, 
they appear in brane motivated models j4], |58) . Aspects on the quantization of 
these kind of Hamiltonian theories may be found in [5] and |54| , to mention some. 
In this section, we will study the Hamiltonian for a free particle under relativistic 
motion with a linear dissipation term, as proposed in ^31i | . This Hamiltonian reads 



H{p,x,t) :=e^Vl+P'e-'^*, (38) 

where p stands for the canonical momentum associated to the dependent variable 
X — x{t), and 7 is the dissipation term. As discussed in |31| . in the low velocity 
regime, this Hamiltonian reduces to the Caldirola-Kanai Hamiltonian which de- 
scribes the motion of a non- relativistic particle with a linear dissipation term [57] . 
The Lagrangian associated to ([38| is given by 



L{t, X, x) = -e'*y/l - (i;)2 , (39) 
and thus the solution to Euler-Lagrange equation 




is given by 



= (40) 



Aoarcsinh(e-^M^ol)^ (41) 
7 l^ol 

where Aq is an integration constant. The functions (O and ^ are P ~ e'''*/(l — 
(i;)2)^/2 and Q = 0, respectively, and thus the Jacobi equation yields 

d / e'-'* df \ _ 

dt [{i-ixyy/^dtj ^^^^ 

In this last equation, the term x must be understood as the time-depending function 
x{t) = Aoe~^*/\/l~+~A^e~^. Therefore, for this model we are able to explicitly 
find the solution to Jacobi equation (j42|) 

/W=^o-- , '"t 2. ' (43) 

being cq and ci integration constants. We then note that this solution has a unique 

zero at the value t = (I/7) log (^\/ ~ AqJ^Cq/jcq^ , so there are no conjugate 

points for the function f{t), and the solution (j4ip is a minimum for the action on 
any interval [0,b], b E K. 

6.5. Quartic potential model. In this section we will develop a model inspired 
by the static kink of the well-known 0^ model in quantum field theory [55], |42| . 
The model can be resolved both on classical and quantum grounds, and contains 
soliton solutions (see below). In the context of brane theories, the so-called kink 
model also appears by the inclusion of rigidity terms associated to the intrinsic 
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curvature in their effective actions [77], |52) . 
The Lagrangian for our model is 

L{t,x,±)^\{xf + \{x^~'^^\ (44) 

where m and A > are arbitrary real constants. The Euler-Lagrange equation 
associated to this Lagrangian reads 

x-\x (^x"^ - = . (45) 

This equation is of Lenard type: x + f{x)x + g{x) = 0, where / = and g{x) — 
~\x — The change of variables u{x) — x converts it in the first-order 

equation uu' ~ \x — , which is immediately integrated to give 

, = ^ / dt = (46 

where a, 6 G R are integration constants. The solutions commonly found in the 

4 

literature (cited above) are obtained by taking 6 = — 2^ , so to get a perfect square 
in the radicand of (^51) . In this way, the resulting solutions are: 

m , / m(t 
y(i) = ±^tanh' ^ 



Va V V2 

The solution with the plus sign is commonly termed the kink solution, while the one 
with the minus sign is called the anti-kink solution. Both solutions are bounded by 
the values ±to/\/A. In particular, the energy density of the kink solution goes as the 
fourth power of the hyperbolic secant, and is localised within a width characterised 
by the quantity l/m [55]. However, other solutions exist. For instance, we could as 
well take 6 = in (|46| to get (through an obvious change of variable) : 

dx 1 /" dr/ 1 . 

arcsm rj , 



V2 J VAx* - 2m2a;2 mV2 J rj^if - 1 m\/2 
and hence the solution to the Euler-Lagrange equation (|^5l) : 



a;(t) = my ^ sec(m(t - a)). (47) 

In order to get the Jacobi equation (IT^ . we consider the functions P = 1 > and 
Q — iXx^ — rn? , which set the equation for the function f{t): 

d? f 

-4 + (l-6sec^ (m{t- a))) f ^0. (48) 

We can take a = and m = 1 without loss of generality (these are just re-scalings) . 
Then, the Jacobi equation has the form / -I- 4>{t)f = 0, where (j){t) = 1 — sec^ t < 
in the interval ] — 7r/2,7r/2[. At the points t = ±7r/2, the solutions have a blow-up 
and are not defined (so they can not be extended beyond these points). Thus, 
the solutions to the Jacobi equation are defined on ] — 7r/2,7r/2[, do not possess 
conjugate points in this interval (see Theorem 15. 4p and the solution is a true 
minimum on ] — 7r/2, 7r/2[. 
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6.6. Probability density and mcLximal entropy. In this section we implement 
a constrained Lagrangian system related to a probability density function |64) . In 
Bayesian probability theory and in statistical mechanics, this system is related to 
the principle of maximum entropy |40) . which also appears in other branches of 
physics, and in chemistry and biology [8], [43], [49], [71] ■ The model is defined as 
follows. Let Z he a random variable, and let p(x) its associated density function, 
so p : M — ^-JO, +c»[. Suppose that we know the second order momentum 



<J ^ J x^pix)dx, (49) 

and that we want to obtain the least biased density function p{x). This may be 
written as the problem of finding the maximals for the entropy functional (defined 
in terms of the information theory): 



S{p)^- J^p{x) log pix)dx, (50) 
and subject to the constraints 

J p{x)dx = 1, (51) 

x^p{x)dx = cr^ . (52) 



Notice that the Lagrangian here, is defined on U = Rx]0, +cx)[x]R, although its 
dependence on the first and third variables is trivial. 
Thus, we have the three functionals (in the notation of subsection [ 



S{p) = - / pix)logp{x)dx, 
Jm 

Ki{p) = / p{x)dx, 
Jr 

K2{p) — / x^p{x)dx. 
It is immediate to compute the variations: 

SS{p,h) = - [ h{x){l + \ogp{x))dx, 

JR 

6Ki{p,h)= / h{x)dx, 
Jr 

SK2{p,h)— / x'^h{x)dx, 



so it is obvious that they are weakly continuous. Let us apply the theorem 12. 151 on 
Lagrange multipliers. The case det{dKi{y, hi)) = (1 < «, Z < 2), would lead to 

det^ /R^i(^)d^ lRh2{x)dx 
\^ Jg^x'^hi{x)dx J^x'^h2(x)dx 

for arbitrary /ii, /12 G X, or: 

J^x^hi{x)dx ^ J^x'^h2{x)dx 



22 



E. LOPEZ, A. MOLGADO, J. A. VALLEJO 



which is absurd. Thus, we can introduce two Lagrange multiphers Ai, A2 and con- 
sider the Lagrangian 

L{x, p, p) = -p{x) log p{x) + \ip{x) + X2X^p{x). 

The Euler-Lagrange equation yields 

- \ogp{x) - 1 + Ai + X2x'^ ^ 

with solution p{x) = e^^+^i+'^^^^ . Substitution of this solution into the con- 
straints ([5T|) and (|52p implies that the Lagrange multipliers are equal to Ai — 
1 + log ^J- and A2 = —1/(2(7^), respectively. Thus, the solution of the Euler- 
Lagrange equation reads 



Finally, we see that the original Lagrangian Lq{x, p, p') = ^p{x) log p{x), can actu- 
ally be seen as a real function of a single variable on ]0, -|-oo[, for which the second 
derivative Lq{p) = — 1/p is always negative. Therefore, Lq is concave and, due 
to Theorem 14.31 and Remark 14.41 the solution obtained is a (global) maximal in 

]0,+oo[. 

Remark 6.1. A posteriori, we see that the solution we have found, ()53p . belongs 
to C^{M.). However, this is not obvious a priori. Indeed, the method of Lagrange 
multipliers is not the best one to deal with the problem involving higher order mo- 
ments, precisely because the eventual solution may lie outside the space from which 
we start, see [37J. 
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